On the Surprising Behavior of Distance Metrics in High Dimensional Spaces

نویسندگان

  • Charu C. Aggarwal
  • Alexander Hinneburg
  • Daniel A. Keim
چکیده

In recent years, the eeect of the curse of high dimensionality has been studied in great detail on several problems such as clustering, nearest neighbor search, and indexing. In high dimensional space the data becomes sparse, and traditional indexing and algorithmic techniques fail from a eeciency and/or eeectiveness perspective. Recent research results show that in high dimensional space, the concept of proximity, distance or nearest neighbor may not even be qualitatively meaningful. In this paper, we view the dimensionality curse from the point of view of the distance metrics which are used to measure the similarity between objects. We speciically examine the behavior of the commonly used L k norm and show that the problem of meaningfulness in high dimension-ality is sensitive to the value of k. For example, this means that the Manhattan distance metric (L1 norm) is consistently more preferable than the Euclidean distance metric (L2 norm) for high dimensional data mining applications. Using the intuition derived from our analysis, we introduce and examine a natural extension of the L k norm to fractional distance metrics. We show that the fractional distance metric provides more meaningful results both from the theoretical and empirical perspective. The results show that fractional distance metrics can signiicantly improve the eeectiveness of standard clustering algorithms such as the k-means algorithm.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

On 5-dimensional 2-step homogeneous randers nilmanifolds of Douglas type

‎In this paper we first obtain the non-Riemannian Randers metrics of Douglas type on two-step homogeneous nilmanifolds of dimension five‎. ‎Then we explicitly give the flag curvature formulae and the $S$-curvature formulae for the Randers metrics of Douglas type on these spaces‎. ‎Moreover‎, ‎we prove that the only simply connected five-dimensional two-step homogeneous Randers nilmanifolds of D...

متن کامل

On the Surprising Behavior of Distance Metrics in High Dimensional Space

In recent years, the effect of the curse of high dimensionality has been studied in great detail on several problems such as clustering, nearest neighbor search, and indexing. In high dimensional space the data becomes sparse, and traditional indexing and algorithmic techniques fail from a efficiency and/or effectiveness perspective. Recent research results show that in high dimensional space, ...

متن کامل

Fixed point of generalized contractive maps on S^{JS}- metric spaces with two metrics

In this paper we prove existence of fixed point theorems for Z-contractive map, Geraghty type contractive map and interpolative Hardy-Rogers type contractive mapping in the setting of SJS- metric spaces with two metrics. Examples are constructed to high light the significance of newly obtained results.

متن کامل

Einstein structures on four-dimensional nutral Lie groups

When Einstein was thinking about the theory of general relativity based on the elimination of especial relativity constraints (especially the geometric relationship of space and time), he understood the first limitation of especial relativity is ignoring changes over time. Because in especial relativity, only the curvature of the space was considered. Therefore, tensor calculations should be to...

متن کامل

Extended graphs based on KM-fuzzy metric spaces

This paper,  applies the concept  of KM-fuzzy metric spaces and  introduces a novel concept of KM-fuzzy metric  graphs based on KM-fuzzy metric spaces.  This study, investigates the finite KM-fuzzy metric spaces with respect to metrics and KM-fuzzy metrics and constructs KM-fuzzy metric spaces on any given non-empty sets. It tries to  extend   the concept of KM-fuzzy metric spaces to  a larger ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2001